Bioinformatics and handwriting/speech recognition: unconventional applications of similarity search tools

نویسندگان

  • Kyle Jensen
  • Gregory Stephanopoulos
چکیده

This work introduces two unconventional applications for sequence alignment algorithms outside the domain of bioinformatics: handwriting recognition and speech recognition. In each application we treated data samples, such as the path of a handwritten pen stroke, as a protein sequence and use the FastA sequence alignment tool to classify unknown data samples, such as a written character. That is, we handle the handwriting and speech recognition problems like the protein annotation problem: given a sequence of unknown function, we annotate the sequence via sequence alignment. This approach achieves classification rates of 99.65% and 93.84% for the handwriting and speech recognition respectively. In addition, we provide a framework for applying sequence alignment to a variety of other non–traditional problems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Machine Learning with Templates

New methods are presented for the machine recognition and learning of categories, patterns, and knowledge. A probabilistic machine learning algorithm is described that scales favorably to extremely large datasets, avoids local minima problems, and provides fast learning and recognition speeds. Templates may be created using an evolutionary algorithm described here, constructed with other machin...

متن کامل

Computer Programming in Java

COLUMBIA UNIVERSITY HIGH SCHOOL SCIENCE HONORS PROGRAM 2007Apr 14 Sat Week 10 1. Machine Learning Intro Wikipedia: As a broad subfield of artificial intelligence, machine learning is concerned with the development of algorithms and techniques that allow computers to "learn". Most of machine learning works on extracting rules and patterns out of massive data sets. Some parts of machine learning ...

متن کامل

A Comprehensive Survey on On-line Handwriting Recognition Technology and Its Real Application to the Nepalese Natural Handwriting

Handwriting Recognition Technology has been improving much under the purview of pattern recognition and image processing since a few decades. This paper focuses on the comprehensive survey on on-line handwriting recognition system along with the real application by taking Nepali natural handwriting (a real example of one of the cursive handwritings). The survey mainly includes pre-processing, f...

متن کامل

Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms

One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005